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Abstract 

Wc introduce a now tost statistic for testing the null hypothesis that the sampling distribiition 
has an increasing hazard rate on a specified interval [0,a]. It is based on a comparison of 
the empirical distribution function with an isotonic estimate, using the restriction that the 
hazard is increasing, and measures the excursions of the empirical distribution above the isotonic 
estimate, due to local non-monotonicity. It is proved in the companion paper Groeneboom and 
Jongbloed (2011a) that the test statistic is asymptotically normal if the hazard is strictly 
t \ increasing on the interval [0, a] and certain regularity conditions are satisfied. We discuss a 

bootstrap method for computing the critical values and compare the test, thus obtained, with 
other proposals in a simulation study. 



1 Introduction 



H 

^ In reliability theory and medical statistics, one is often interested in the life time distribution of 

I a certain subject, e.g. the distribution of the time it takes before effect of a certain treatment 
can be noticed or the time it takes before a system device breaks down. In such situations, it is 
^ more natural to model the distribution in terms of its hazard rate (or failure rate) than in terms 

\Q of the distribution function or density function. Qualitative properties of the hazard rate can be 

O most easily interpreted. These reveal whether or not the device is subject to aging (in case of an 

increasing failure rate) or not. 

Already in the sixties of the preceding century, estimation of the behavior of the hazard rate 
based on a sample from the associated distribution, was studied intensively. The (nonparametric) 
^ maximum likelihood estimator (MLE) for the hazard rate is described in, e.g.. Barlow et al. 

(1972) and has properties somewhat comparable with the Grenander estimator (MLE) of a de- 
^ ^ creasing density. Also, procedures were developed to test the null hypothesis of a constant hazard 

^ rate (corresponding to an exponential distribution) against the alternative of an increasing hazard 

^ (presence of aging). One popular test statistic in this context is the Total Time on Test statistic 

of Proschan and Pyke (1967). This is a scale invariant statistic, allowing for efficient computation 
of Monte Carlo-based critical values. 

Only rather recently, the problem of testing the nonparametric null hypothesis that a hazard 
rate is (locally) monotone against the alternative that it is not, has gained attention. In Gijbels 
AND Heckman (2004) local versions of the test statistic of Proschan and Pyke (1967) are studied. 
In Durot (2008), the supremum distance between two estimators of the cumulative hazard rate 
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is introduced as test statistic. In both papers, critical values are obtained using the exponential 
distribution, which lies 'on the boundary of the null hypothesis'. As will be seen in this paper, 
this choice of the exponential distribution leads to conservative tests when the true underlying 
distribution has a strictly convex cumulative hazard. 

Hall and van Keilegom (2005) developed a test which "projects" the hazard on the space of 
nondecreasing hazards by performing a global smoothing of the hazard until it becomes nondecreas- 
ing, using kernel estimators. Their criterion for non-convexity of the (non-smoothed) cumulative 
hazard is compared with the same criterion for a bootstrap sample from the projected (smooth) 
hazard (this method has been called a "biased bootstrap"). The idea is that the criterion will be 
close to zero for bootstrap samples generated from the projected hazard, while the criterion will not 
be close to zero for the original sample, if the underlying hazard is not monotone. They compare 
the criterion in the original sample with, say, the 90th or 95th percentile of the distribution of the 
criterion in the bootstrap samples, and reject the hypothesis of monotonicity if the criterion in the 
original sample exceeds the chosen percentile of the distribution of the criterion in the bootstrap 
samples. There are some difficulties with this interesting idea, having to do with non-conservative 
behavior of this procedure. We will discuss this below. 

In this paper we propose another approach, where the type of projection is different from the 
projection used by Hall and van Keilegom (2005). As in Durot (2008), our test statistic is a 
distance between two estimators for the cumulative hazard. One under the local monotonicity 
hypothesis and another nonparametric estimator that does not require this monotonicity. Our 
distance measure is of integral type rather than the supremum distance considered in Durot (2008). 
In order to obtain critical values for this test statistic, we propose a bootstrap procedure. Our 
approach will be described in section 2. As in the method used by Hall and van Keilegom 
(2005) and in a certain sense also the method used by Durot (2008), we will use a bootstrap 
method for obtaining critical values. In generating the bootstrap samples, we use certain results in 
Groeneboom and Jongbloed (2011b), and in the justification of this bootstrap method, we will 
heavily rely on results in Groeneboom and Jongbloed (2011a). Section 3 contains a simulation 
study of the various testing procedures, showing that the proposed test has a rather good power, 
without exhibiting the extreme anti-conservative behavior, exhibited by the method, proposed in 
Hall and van Keilegom (2005). The appendix provides the proofs of certain results in section 2. 



2 Setting and testing procedure 

Consider a sequence of i.i.d. random variables Xi, X2, . . . with density function /o on [0, 00). Denote 

the distribution function, hazard function and cumulative hazard function associated with /q by 
Fq, ho and Hq respectively and recall the relations between these functions: 

ho{x) = -^Q^^) Ho{x) = - log(l - Fo{x)), Fo{x) = 1 - exp(-iJo(x)). (2.1) 

In this paper, we consider the problem of testing local monotonicity (we restrict ourselves to the 
increasing case; the case of locally decreasing hazard can be considered analogously) of ho. More 
precisely, given an interval [a, b] C [0, 00), we wish to test 

H[a,b] : Vx,y G [a,b] with x <y, ho{x) < ho{y) 

against the alternative that this monotonicity does not hold. Our test statistic is defined as a 
distance between two estimators for the cumulative hazard function: one under H^^fi] one that 
is not. 
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An estimator for Hq without assuming H^a^b] i is just the empirical cumulative hazard function 
obtained by plugging in the empirical distribution function of the sample Xi, . . . , Xn, F„, in (2.1): 



Unix) 



r - log {1 - ¥n{x)} , xe[0, , 

Our estimator of under -ff[a,6] is the least squares estimator, minimizing the function 

h^-f h{xfdx-f h{x)(mn{x) (2.3) 

2 J[a,b] J[a,b] 

over all nondecreasing functions h on [a,h\. 

The solution of the problem of minimizing (2.3) under the null hypothesis can be constructed 
explicitly. On [a, 6] it is given by the right-continuous derivative of the convex minorant (GCM) of 
the empirical cumulative hazard function given by (2.2), restricted to [a, 6]. The estimator of 
Hq under the null hypothesis -ff[a,6] is therefore defined by 

TT (^\-\ ^n{x) X £ [0, a) U [b, oo) , . 

an{X) <y QCM^y ^ jj^^y) : a<y< b){x) x G [a, b]. ^ ' 

Note that this estimator is continuous at a and, if b ^ X^ for all i, at b. 

Our test statistic for testing the null hypothesis of monotonicity of the hazard on the interval 
[a, b] C (0, oo) is defined by 

Tn= [ {¥n{x-)-Fn{x)}dFn{x). (2.5) 
Jla,b] 

where F„ is the distribution function corresponding to Hn- 

F„(x) = l-e-^"(^). 

Note that T„ > 0, since Hn is the greatest convex minorant (hence a minorant) of ]HI„ on [a,b]. 
Also note that under the alternative hypothesis, r„ will tend to be higher than under the null 
hypothesis. 

To illustrate the behavior of the estimator we introduce the family of hazards {h^'^^ : d G 
[— 1, 1]}, also considered in Hall and van Keilegom (2005): 

hd{x) = ^ + i { - !)' + (!)'} + dx\ X > 0. (2.6) 
The corresponding distribution functions are given by: 

fW(x) = 1 - exp [-\x -l{\{x-lY+ (1)^} - \dx' + I (1)^} ,x>0. (2.7) 
If d > we get a strictly increasing hazard; if d < 0, the hazard is decreasing on the interval 



and if d = the hazard has a stationary point at x = 3/4. See Figure 1 for some hazards in this 
family. 
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Figure 1: The hazard functions h^'^^ for d = —1, —0.75, —0.50, —0.25 (dashed), d = (full curve) 
and d = 0.25,0.50,0.75,1 (dotted) corresponding to distribution functions (2.7). The stationary 
points are shown by the red dots. 

Remark 2.1 Note that we need the constant | (|)^ in the exponent to make the distribution 
function zero at the left endpoint 0, but that this constant is missing in the formula given below 
(4.1) on p. 1121 in Hall and van Keilegom (2005). 

The rather different nature of our isotonic projection of the hazard rate and the projection of 
Hall and van Keilegom (2005) is illustrated in the left panel of Figure 2, where d = —1. Their 
hazard estimate, given by the blue curve in Figure 2 extends (with positive values) to the left of 
zero and has a slower increase to the right of 2.0 than the actual hazard which is given by the black 
curve (which is clearly not monotone). The isotonic projection, on the other hand, only lives on 
[0,00), and follows the steep increase of the real hazard to the right of 2.0, whereas it only locally 
corrects for the non-monotonicity. The interval on which the hazard was estimated (and made 
monotone) was [0, F~^(0.95)) a; [0,2.31165), where F = F^*^) in (2.7) with d = -1. 

On the other hand, if we are at the other end of the family : d G [—1,1]} at d = 1, 

and therefore "deep inside the null hypothesis region" , so to speak, the starting bandwidth for the 
calibration of the Hall and van Keilegom (2005) method immediately gives an increasing hazard 
on the interval [0, F~^(0.95)) ^ [0, 1.39778), where F = F^^^ with d = 1, and the projections of the 
two methods are less different, see the right panel in Figure 2. 

In order to obtain critical values for statistic T^, there are various possible approaches. The 
first is to use that its distribution under H G -f^[a,b] is stochastically bounded by its distribution 
under the distribution function with the cumulative hazard function that is obtained by linear 
interpolation on the interval (a, b) 

rr f H{b){x-a) + H{a){b-x) ^ ^^^u^^^ (\ 
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Figure 2: The real hazard function /i^"^) (black), the isotonic estimate hrf^ of the hazard (red), and 
the Hall and van Keilegom estimate (blue) of the hazard (after calibration), for a sample of size 
n = 1000 from the distribution function F^'^\ The left panel corresponds to d = —1, the right 
panel with d = 1. 



Lemma 2.1 For each H G -f^[a,6]; 

Ph {Tn >t)< Ph^ , (Tn > t) (2i 

for all t > 0. 



Remark 2.2 The proof of Lemma 2.1 (given in the appendix) reveals that the stochastic ordering 
result also holds if in the definition of T„ (¥n{x) — F„(x)^ would be used for some 1 < p < oo 
rather than for p = 1. Lemma 2.1 is related to the approximation in Durot (2008), section III. 
If H{a) and H{b) were known, the distribution of r„ under Ha^b could be approximated efficiently 
using Monte Carlo simulation. In practice, however, H{a) and H{b) are unknown. In order to really 
use the approximation, estimates for Hq at a and b are needed. In Durot (2008) this estimation, 
combined with the stochastic domination of Lemma 2.1, is called the bootstrap. 

It is clear that if the function Hq is strictly convex on [a,b], the lower bound of Lemma 2.1 
may be quite rough. Also, overestimation of the interval [Ho{a), Ho(b)] will lead to a rough bound. 
In case of strict convexity of Hq on [a,b], the convex minorant of its empirical version will tend 
to wrap tightly around this version whereas in case the cumulative hazard is linear on [a, b] (as 
in the exponential case), this difference will tend to be bigger. The following theorem, proved in 
Groeneboom and Jongbloed (2011a), describes the asymptotic behavior of T„, if the underlying 
hazard is strictly increasing on [a,b]. 
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Theorem 2.1 Let ho be strictly increasing and positive on the interval I = [a, b] C [0, oo), with a 
bounded continuous derivative, staying away from zero on I. Moreover, let (^{t) be the distance at 
t of the process 

W{x) + x^, X G M, 

to its greatest convex minorant, where W is two-sided Brownian motion, originating from zero. 
Then, for as defined in (2.5), 

n^/^{Tn-ETn}^N{0,al^ 
where N (O, (t|,^) is a normal distribution with mean zero and variance ap^, and where 

ET„ ~ n-'/'ECiO) £ (^^^tII^) dFoit), n ^ oo, (2.9) 

and 

a% = 2^°°covar(C(0),C(.))d. (^^^^|||^)'^' dFo{t). (2.10) 

If we want to test the hypothesis that the hazard is strictly increasing on [a,b], we could try to 
estimate the parameters /i and a of Theorem 2.1 and use the limiting normal distribution for the 
critical values. The difficulty with this approach is that it cannot be used if the derivative of ho is 
zero, as is the case, for example, if the underlying distribution is the exponential distribution. For 
the latter situation we have the following result, proved in Groeneboom and Jongbloed (2011a). 

Theorem 2.2 Let U be given by 

where W is standard Brownian motion on [0, oo) and C is the greatest convex minorant of 

Suppose that the underlying hazard ho is constant on [0, a] . Then: 

So we see that in this situation the rate of convergence drops from n^/^ to n^/^, and also that 
the limit distribution is no longer normal. In the case of the family h^'^\ we therefore enter a 
completely different regime of asymptotic behavior when the parameter d passes zero. The most 
natural way to deal with this diflBculty seems to us to be a bootstrap procedure which we will now 
describe. We will only prove that our bootstrap method works under the conditions of Theorem 
2.1. We conjecture, however, that our method will also work under the conditions of Theorem 2.2 
(possibly with a slightly modified version of Hn), but this is still an open question. 

The proposed method runs as follows. First estimate the cumulative hazard function under the 
null hypothesis by a smooth estimator, having the property that the corresponding hazard satisfies 
the null hypothesis. Then draw samples of size n from this estimate B times and compute B times 
the bootstrap version of the test statistic: T*-, 1 < i < B. Finally, approximate the distribution 
of Tn under the true cumulative hazard function Ho (assumed to belong to -ff[a,6] ) by the empirical 



6 



distribution of these bootstrap values and its critical value at (for example) level 10% by the 90-th 
percentile of this generated set of bootstrap values. In fact, Hall and van Keilegom (2005) also 
use a bootstrap procedure of this type, but based on a totally different "projection" of the hazard 
estimate on the set of increasing hazards. 

In the further description of the method, we take the left endpoint of the interval at the origin 
and denote the right endpoint by a, as in Groeneboom and Jongbloed (2011a). In order to prevent 
inconsistency of hn at the endpoints, we define a penalized version of hn, hn , as the derivative of 
the penalized cusum diagram consisting of the points 

(0,0), (X(,),H„(X(i)-) + 2n-2/3),X(i)<a, (a,H„(a-)). (2.12) 

The left derivative of the present cusum diagram minimizes the criterion 

I / h{xfdx- [ h{x)(Mn{x)-anh{0) + I3nh{a), (2.13) 

^0 JlO,a] 

where = /?n = 2n~^/^, over all nondecreasing functions h on [0,o]. The reason for choosing a 
penalty of order cn~'^l^ is explained in Groeneboom and Jongbloed (2011b), where also a proof 
of the consistency of this estimator at the boundary points is given. 

For X G [0, a], we estimate the hazard by kernel smoothing of . Let K be the triweight kernel 

K{u) = 1(1- v'f ^ e K. (2.14) 

This is a mean zero probability density with second moment 1/9. Then, define for bandwidth 
6n > 

K{x) = J K,^{x - y) dH^\y) = J K,^{x - y) h^Xy) dy, (2.15) 
where Ki,^{u) = K{u/bn)/bn- Equation (2.15) can then be written as 

hn{x) = [ K,Sx-y) f dht\u)dy = [[ K,^{x-y)dydh]P\u) 

J Jo JJu<y 

rx+bn rx+bn rx+b„ /^_,,\ 

= / / K,^{x- y) dydht\u) = ^ dht\u), 

Ju=0 Jy=u Ju=0 \ "n / 



iy=u 

where 



/u ru 
K{w) dw = l[_i^i)(u) / K{w) dw + l[i^oo)(^)- 
-oo J — 1 

The corresponding estimate of the H'q and Hq are then given by 

h'^{x) = J Kb^ix - y) dUP\y) and Hn{x) = ^ hn{u) du. (2.16) 

In justifying this method for testing that the hazard is strictly increasing on [0, a], we use the 
following bootstrap version of Theorems 2.1, which will be proved in section 4. 

Theorem 2.3 Let the conditions of Theorem 2.1 he satisfied, and let Hn he the estimate of the 

cumulative hazard function under the null hypothesis, defined by (2.16), and based on a sample 
Xi, . . . , Xji from, Fq, where we take a vanishing bandwidth 6„, satisfying 6„ > n^^/''. Let Xf, . . . , X* 
be a bootstrap sample generated by Hn and let F* and F* be the (bootstrap) empirical distribution 
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Figure 3: The estimate hn (blue) of the hazard h'^'^^ (black) of the family {/i^*^) : d G [—1,1]} for 
a sample of size n = 100, together with the (penalized with 2n~^/'^ ~ 0.093) isotonic estimate hn 
(red) on the 95% percentile interval [0, [F^^^]'^ (0.95)]. Bandwidth 5„ = n'^/^ w 0.316. 

function and corresponding estimate F*, based on the greatest convex minorant of the function 
X I— 7- — log(l — F*(x— )), respectively. Finally, let T* be defined by 

T:= [ {¥Ux-)-F*ix)}dFUx), 

J [0,a] 

and let its (bootstrap) expectation be defined by 




where Fn{x) = 1 — exp{— //„(x)}. Then we have, almost surely, 

^5/6 S^T* - E*T* I Xi, . . . , A iV (0, 4J , 
as n ^ oo, where ap^ is given in Theorem 2.1. 

3 A simulation study 

In this section we compare the power behavior of the test based on our test statistic T^, defined by 
(2.5), with other test statistics for the families that were also considered in Hall and van Keilegom 
(2005). The bootstrap resampling for T„ was done by taking B = 2000 samples from the estimate 
Hn defined in (2.16) with bandwidth 6„ = n~^/'^. For the estimator fif} on which Hn is based (see 



8 



(2.12)), the penalty was taken equal to 2n . The sample was generated by first generating a 
standard exponential sample Ei, . . . , En, producing the bootstrap sample via 

X* = H-\Ei),l<i<n. 

In this way, B values T* were obtained. The critical value is taken to be the 90th percentile of 
these values of T* . 

Below we also make a comparison with a test, proposed in Durot (2008), referred to as Durot 
test in the sequel. This test is based on the supremum distance between the empirical cumulative 
hazard function and its greatest convex minorant: 

Tn,Durot = sup {BI„(a::) - -ff„(x)}. 

xe[0,a] 

For determining a critical value, again B = 2000 random standard exponential samples were gen- 
erated, and the value 

TlDurot= sup {M*^{X) - H*n{x)} 

xe[o,m„{a)] 

was determined for each such "bootstrap" sample (taking the interval [0,lHI„(a)] as interval of 
convexity). The critical value was then taken to be the 90th percentile of the so obtained values of 
T^Durof Note that this procedure is equivalent to the procedure that first estimates the (constant) 
hazard rate on [0,a] by BI„ (a)/a, then takes bootstrap samples from the exponential distribution 
with this hazard rate and finally determines the supremum distance between the two resulting 
estimators on the interval [0,a]. 

In Table 1, four tests are compared: the test, based on T„, the test proposed in Hall and van 
Keilegom (2005) (in the sequel referred to as HvK test), the Durot test and the integral statistic 
version of this statistic, where we replace the maximum distance statistic by r„, defined by (2.5) 
using Durot's method of approximating the critical value. In this table the tests are compared on 
the fixed interval [0, a] = [0, Fq"^(0.95)] (instead of on the random interval [0,F~^(0.95)], as in Hall 
AND VAN Keilegom (2005). In all cases we generated 2000 samples, and also B = 2000 bootstrap 
samples from each original sample. 

The simulations for Hall and van Keilegom (2005) took rather long, since repeated density 
estimation is needed at each step in view of the needed calibration of the bandwidth to create a non- 
decreasing hazard in the original sample. Also, one has to compute an estimator of the distribution 
function, the density, and the derivative of the density to check whether one gets a nondecreasing 
hazard on the chosen interval at the critical bandwidth. The estimation of the density and its 
derivative was speeded up by using Fast Fourier Transform, and the distribtution function was 
computed by numerically integrating the density estimate. 

It is seen from Table 1 that the test based on T„ is slightly more powerful for the alternatives F^*^^ 
for d G [—1, —0.5] than the HvK test. Table 2 shows that the HvK test is rather anti-conservative. 
This seems to suggest that the high power in the region d G [—0.5, 0] is at least partly due to the 
anti-conservative behavior of this test. The Durot test is very conservative for this interval, as is to 
be expected, since the estimated critical value is based on the exponential distribution. The test 
based on 7^2, has a middle position: it is more conservative than the HvK test but less conservative 
than the Durot test. A graphical comparison of the power functions is given in the left panel of 
Figure 4. 

Interestingly, the power of the Durot test increases considerably if we take a smaller interval 

[0, -Pq~^(0.80)). In fact, the Durot test proposed is derived under the assumption that not all order 
statistics belong to the interval [O, a]. But this often happens if we take [0, o] = [0, FJ~^(0.95)), in 
particular for the "bootstrap samples" . 
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Another reason for the higher power of the Durot test in this situation is the fact that the isotonic 
projection of the hazard h^'^\ for d € [—1,0] is almost constant on the interval [0, i^Q"^(0.8)), since 
we miss the steeply increasing part of the hazard from Fq~^(0.8) to Fq~^(0.95), so sampling from 
the isotonic projection is almost the same as sampling (locally) from an exponential distribution in 
this case. 

The results arc shown in Tables 3 and 4, and the right panel of Figure 4. If one chooses this 
interval, the HvK test is very powerful, but also very anti-conservative. For example, for d = 
(which belongs to the null hypothesis region) one gets an estimated rejection probability of more 
than 25% instead of the desired 10%. 

Table 1: Estimated powers for model (2.7), where a = 0.1, n = 50, and d = —1, —0.9, . . . , —0.1. 
The estimation interval is [0, Fq"^(0.95)]. 





d 




-1 


-0.9 


-0.8 


-0.7 


-0.6 


-0.5 


-0.4 


-0.3 


-0.2 


-0.1 


Tn 


0.869 


0.699 


0.547 


0.408 


0.323 


0.234 


0.195 


0.152 


0.125 


0.112 


HvK 


0.833 


0.636 


0.467 


0.361 


0.297 


0.234 


0.200 


0.183 


0.152 


0.151 


Durot 


0.042 


0.029 


0.024 


0.021 


0.016 


0.018 


0.015 


0.018 


0.015 


0.017 


Durot, Tn 


0.258 


0.162 


0.111 


0.057 


0.040 


0.028 


0.022 


0.015 


0.009 


0.004 



Table 2: Estimated rejection probabilities for model (2.7) under the null hypothesis, where a = 0.1, 
n = 50, and d = 0, 0.1, . . . , 1. The estimation interval is [0, F(^^(0.95)). 





d 







0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


1.0 


Tn 


0.097 


0.097 


0.080 


0.0896 


0.086 


0.072 


0.076 


0.077 


0.081 


0.075 


0.071 


HvK 


0.146 


0.138 


0.132 


0.130 


0.124 


0.122 


0.110 


0.103 


0.102 


0.099 


0.110 


Durot 


0.021 


0.019 


0.018 


0.013 


0.015 


0.018 


0.021 


0.024 


0.015 


0.018 


0.021 


Durot, Tn 


0.003 


0.003 


0.004 


0.001 


0.002 


0.003 


0.002 


0.001 


0.001 


0.001 


0.000 



Table 3: Estimated powers for model (2.7), where a = 0.1, n = 50, and d = —1, —0.9, . . . , —0.1. 
The estimation interval is [0, F(^''^(0.8)]. 





d 




-1 


-0.9 


-0.8 


-0.7 


-0.6 


-0.5 


-0.4 


-0.3 


-0.2 


-0.1 


Tn 


0.880 


0.726 


0.569 


0.433 


0.332 


0.246 


0.204 


0.163 


0.140 


0.127 


HvK 


0.965 


0.864 


0.766 


0.686 


0.544 


0.483 


0.434 


0.326 


0.299 


0.279 


Durot 


0.645 


0.524 


0.399 


0.303 


0.231 


0.168 


0.127 


0.097 


0.080 


0.075 


Durot, Tn 


0.742 


0.569 


0.395 


0.253 


0.181 


0.121 


0.067 


0.063 


0.038 


0.030 



It is also of interest to compare the powers of the procedure, based on bootstrapping from a 
penalized and smoothed version of the hazard, with the powers obtained by just bootstrapping 
from the isotonic piecewise constant hazard estimate without any smoothing or penalizing. This 
is done in Figure 5, where it is seen that the difference is not very large for this family (and this 
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Figure 4: The estimated power curves for the family {F^'^^ : d G [— 1, 1]}, the isotonic test statistic 
T„, defined by (2.5) (red), for the HvK test (blue), the Durot test (green), and the integral statistic 
version of this method (black). The sample size n = 50 and the estimation interval is [0, -Fq"^(0.95)] 
in the left panel, [0, Fq"^(0.8)] in the right panel. 



Table 4: Estimated rejection probabilities for model (2.7) under the null hypothesis, where a = 0.1, 
n = 50, and d = 0, 0.1, . . . , 1. The estimation interval is [0, Fq (0.8)]. 





d 







0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


1.0 


Tn 


0.101 


0.103 


0.101 


0.102 


0.096 


0.094 


0.087 


0.091 


0.085 


0.073 


0.074 


HvK 


0.256 


0.229 


0.192 


0.188 


0.170 


0.139 


0.145 


0.132 


0.121 


0.131 


0.112 


Durot 


0.060 


0.047 


0.043 


0.037 


0.027 


0.029 


0.037 


0.034 


0.028 


0.026 


0.025 


Durot, Tn 


0.024 


0.019 


0.016 


0.009 


0.013 


0.009 


0.005 


0.006 


0.006 


0.004 


0.004 



sample size). The general trend is that bootstrapping from the isotonic estimate itself gives more 
conservative critical values. 

In Hall and van Keilegom (2005) also the model, where the hazard function is of the form 

hpn,,^A^) = exp {/3 {iTra^y^^'' exp {-(x - f,f/{2a^)}] , (3.17) 

is studied. Typical members of the family are shown in Figure 6. For this family T„ also seems 
to provide the most "all-round" test, since it is more powerful than the HvK test for the global 
alternatives, where 7 < (note that the hazard is globally decreasing for these alternatives) and 
more powerful for detecting the local disturbances where 7 > than the Gijbels and Heckman 
(2004) and Proschan and Pyke (1967) tests. Note that the HvK test gives rejection probabilities 
which are all above the 10% level in the null hypothesis region. For the exponential distribution 
(/3 = 7 = 0) the rejection probability is even close to 50%! The test, based on T„ also gives a 
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Figure 5: The estimated power curves for the family {F^'^^ : d G [—1, 1]} and the isotonic test 
statistic T„, defined by (2.5), for critical values estimated by bootstrapping from a penalized and 
smoothed isotonic estimate (red) and for critical values estimated by bootstrapping from the isotonic 
estimate itself (blue). The sample size n = 50 and the estimation interval is [0, Fq"^(0.95)] in the 
left panel, [0, Fg"^(0.8)] in the right panel. 

rejection probability which is too high here. This is probably caused by boundary effects and could 
possibly be remedied by adding heavier penalties in the cusum diagram at the beginning and end 
of the interval which is considered. The test was computed for the interval [0, -Fq~^(0.95)]. 

The test based on T„ also has higher power for this family than the Durot test, except when 
7 = 0. When 7 = (and f3 = 0.3) the hazard is constant except for a local bump (see Figure 6), 
so the isotonic projection is the constant hazard. Since the critical values in the Durot test are 
specifically based on a (locally) constant hazard, its behavior in this situation is not surprising, 
because resampling from the exponential distribution is in this case almost the same as resampling 
from the isotonic projection of the real hazard. 

4 Appendix 

Proof of Lemma 2.1: Let Ei,E2, ■ ■ ■ ,En be an i.i.d. sequence of standard exponential random 
variables. Define 

Xi = H-\Ei), Yi = H-l{E,) for I < i < n. 

Then the Xj's and the l^'s are samples from the distributions with cumulative hazard H and H^^b 
respectively. Denote by Un the test statistic (2.5) based on the Yi's and by Vn the statistic based 
on the Xj's. Furthermore, define the function (p : [a,b] — )■ [a,b] by (f>{x) = H~^{H{x)). Note that 
4> is convex and increasing on [a, 6] and that Yi = (piXi) < Xi for all i. Moreover, using obvious 
notation, 

¥^{x) = -#{i : X,<x} = -# {i : (/.(X,) < </.(x)} = -#{i:Yi< ^{x)} = F^(0(x)). 
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Figure 6: The hazard function /i/j^-y^^^o-- The sohd hne in the left panel corresponds to /3 = 0.3, 7 = 
0, /i = 1 and a = 0.2; the dashed line with /3 = 0.3, 7 = —0.5, = 1 and a = 0.1. In the right panel 
the solid line corresponds to /3 = 0.3, 7 = 0.5, = 1 and a = 0.1; the dashed line with /i/s,^,^,^, for 
j3 = 0.3, 7 = 0.5, /X = 1 and £7 = 0.2. 

Table 5: Estimated rejection probabilities for model (3.17), for a = 0.1 and n = 50. The numbers 
in italics are rejection probabilities under the null hypothesis. The values for the HvK, PP and GH 
tests were taken from Table 1, p. 1124, in Hall and van Keilegom (2005). 
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Parameter 


Test 


-0.5 


-0.25 





0.5 


1 


/3 = 


Tn 


1.000 


0.792 


0.213 


0.050 


0.076 




HvK 


0.844 


0.525 


0.437 


0.189 


0.121 




Durot 


0.704 


0.307 


0.096 


0.031 


0.028 




PP 


1.00 


0.800 


0.100 


0.000 


0.000 




GH 


0.983 


0.416 


0.100 


0.034 


0.027 


cr = 0.1 


Tn 


0.985 


0.549 


0.229 


0.497 


0.585 


^l = l 


HvK 


0.675 


0.753 


0.772 


0.656 


0.508 


13 = 0.3 


Durot 


0.501 


0.417 


0.320 


0.182 


0.107 




PP 


0.997 


0.458 


0.019 


0.000 


0.000 




GH 


0.962 


0.291 


0.178 


0.176 


0.154 


cr = 0.2 


Tn 


0.991 


0.605 


0.172 


0.214 


0.216 




HvK 


0.715 


0.714 


0.663 


0.443 


0.277 


j3 = 0.3 


Durot 


0.545 


0.346 


0.218 


0.090 


0.045 




PP 


0.999 


0.588 


0.053 


0.000 


0.000 




GH 


0.968 


0.301 


0.114 


0.065 


0.054 



13 



Consequently, also BI^(x) = M^((f){x)), where these functions refer to the empirical cumulative 
hazards based on the sample of Xj's and 1^'s respectively. Now define Hn{x) = {(pix)), where 
the latter denotes the greatest convex minorant of the empirical hazard function based on the l^'s, 
evaluated at (?!>(x). Then Hn is a minorant of H^, i.e., 

Moreover, it is also convex. Indeed, using monotonicity and convexity of and convexity of 

we have for a G (0, 1) and x, y G [a, b] 

Hniax + {1 - a)y) = {(f>{ax + {1 - a)y)) < {a(f>{x) + {1 - a)4>{y)) 

< aH^i<Pix)) + (1 - a)H^{(Piy)) = aHn{x) + (1 - a)^„(y). 

Hence, the convex minorant Hn of is smaller than or equal to the greatest convex minorant 
iJ^ofH^: 

Hn{x) < H^{x) < M^ix) ^ F^(x) - F^{x) < F^(x) - 

where we use the obvious notation relating cumulative hazards to distribution functions. This 
implies that 

Un-. = [ (¥l{x-)-F^{x))d¥l{x)= [ (¥l{cP{x)-)-F^i^{x)))dFl{cl>{x)) 

J[a,b] ^ ^ i[a,6] ^ ^ 



/ 

J\a. 



J\a,b] ^ ^ 



Noting that Ph {Tn >t) = P{Vn > t) and Ph, ^ {Tn >t) = P{Un > t), the result follows. □ 

Proof of Theorem 2.3. The result follows from Theorem 2.1, if we can show that the estimate 
Hn, which generates the bootstrap samples, has the property that the corresponding estimates /„, 
hn and of /o, ho and H'q, respectively, will be consistent (in an almost sure sense), since in this 
case the integrals 

. h'nit) , 



1/3 

fnit)dt, (4.18) 



and 



2hn{t)fn{t) 



4/3 

fn{t)dt (4.19) 



V Kit) J 

will converge (almost surely) to the integrals defining (2.9) and (2.10). Moreover, the consistency 
will ensure that the derivative h'^ will stay away from zero for large n, implying that the distribution, 
generating the bootstrap samples will satisfy the conditions of Theorem 2.1 for large n, implying 
that the asymptotic normality result also holds for the test statistics, computed for the bootstrap 
samples (with parameters /x* and a*, derived from (4.18) and (4.19)). 

But the uniform consistency of the estimates hn and /„ on [a, b] is proved in Groeneboom and 
JONGBLOED (2011b) (here we also use the penalization of hnl), and the consistency of /i'^ on the 
interior of [a, b] is ensured by the choice of the bandwidth cn~^^^. Since this choice of bandwidth 
also ensures that the right limit of h'n at a and the left limit of h'n at b will be positive for all large 
n, we indeed have: 




['foit) 

J a 
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and 

'?hmMX'%M)it^ tut? f^hMmy" 




and the result now follows. □ 



Remark 4.1 In order to get a good estimate of the critical value, one wants to choose a small 
bandwidth in estimating the hazard function for the bootstrap samples, in order to minimize the 
bias. However, since one also wants to estimate the derivative /iq consistently on the interval [a, b], 
the bandwidth cannot be too small. As an example, the choice 6„ = n~^^^ is too small for this 
purpose. This motivated the choice of the bandwidth of order n~^^*, but also other choices are 
possible. Silverman (1978) gives the necessary and sufficient condition: 

log(l/6„) ^ 

for uniform consistency of a kernel estimate of the density in ordinary density estimation (see his 
Theorem C, p. 182), where 6„ again denotes the bandwidth. 
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